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ON  NEAREST  NEIGHBOR  DEGENERACIES  OF 


INDISTINGUISHABLE  PARTICLES 
Benjamin  Kedem 
ABSTRACT 

Arrangement  degeneracies  suggested  by  sufficient  statistics 
associated  with  binary  stationary  m'th  order  Markov  chains  are 
discussed,  and  are  shown  to  correspond  and  generalize  some  degeneracies 
arising  when  indistinguishable  particles  are  placed  on  a  one¬ 
dimensional  lattice  with  n  compartments.  From  these  statistics 
it  is  possible  to  define  an  m-th  order  unit.  The  arrangement 
degeneracy  obtained  from  s  l's  and  n-s  0's  so  that  lower 
order  units  are  placed  in  higher  order  units  is  solved.  However, 
the  most  general  arrangement  degeneracy  associated  with  all  the 
sufficient  statistics  of  a  given  order  is  difficult.  For  this 
case  only  the  31  order  arrangement  degeneracy  is  obtained,  the 
1st  and  2nd  orders  being  relatively  simple.  These  results  are 
applied  in  determining  the  asymptotic  distributions  of  rare  events. 
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ON  NEAREST  NEIGHBOR  DEGENERACIES  OF 


INDISTINGUISHABLE  PARTICLES 


Benjamin  Kedem 

INTRODUCTION :  In  statistical  mechanical  treatment  of  cooperative 

phenomena  such  as  magnetic  spin,  binary  alloys,  elasticity, etc . , 
the  total  energy  of  interaction  E^  is  given  as  a  linear  combination 
of  potential  energies  associated  with  certain  arrangements  of 
indistinguishable  particles  on  a  one  dimensional  lattice.  For 
example  if  only  occupied  nearest  neighbors  and  next  nearest 
neighbors  of  the  types  11,  101,  111,  are  of  interest  then 


nV  +  n  V  +  n  V 
11  11  101  101  nlll  111 


where  the  V's  stand  for  potential  energy  and  the  n’s  refer  to 
the  frequency  of  occurence  of  the  neighbor  type.  Here  1  refers 
to  an  occupied  site  while  0  refers  to  a  vacancy.  Of  interest 
then  is  knowledge  of  the  arrangement  degeneracy  associated  with 
the  types  of  neighbors.  That  is,  the  number  of  binary  sequences 
of  size  n  which  satisfy  certain  restrictions  (numbers  of  types). 
Much  attention  to  these  combinatorial  problems  has  been  given  by 
McQuistan  in  a  series  of  articles  in  which  he  considered  both 
simple  and  complex  particles  such  as  dumbbells,  e.g.  see  references 
[3][4][5].  Parallel  to  the  physical  interpretation  of  arrange¬ 
ment  degeneracies  there  is  a  purely  statistical  one  in  which 
arrangement  degeneracies  are  used  in  approximating  important- 
distributions  such  as  the  distributions  of  crossings  and  upcrossings 
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of  a  fixed  level  by  a  stationary  process  and  the  distribution  of 
extremes  in  such  processes,.  This  has.  recently  been  dealt  with 
in  [2  ] . 

The  purpose  of  this  article  is  to  show  the  connection  between 
the  purely  statistical  and  statistical  mechanical  approaches  in 
regard  to  arrangement  degeneracies  by  examining  sufficient 
statistics  associated  with  m'th  order  Markov  chains.  At  the 
same  time  we  shall  extend  some  of  McQuistan  results  by  introducing 
higher  order  degeneracies  whose  usefulness  will  be  demonstrated 
in  finding  the  asymptotic  distribution  of  "rare”  events  . 

2.  A  CONNECTION  BETWEEN  NEAREST  AND  NEXT  NEAREST  NEIGHBOR 
DEGENERACY  AND  SUFFICIENCY 

Let  {X  ,t=0, ±1, . . . }  be  a  two  state  (0-1)  stationary 
Markov  chain  and  consider  a  binary  time  series  from  the  chain, 
X^,...,X  .  The  sufficient  statistics  associated  with  the  chain 
are  [  2  ] 


S 


n 

l 

i=l 


l 


n 


i=  2 


X.X. 
l  l-l 


H 


X  +  X  . 
1  n 


In  order  to  find  the  joint  distribution  of  S,  R^,  H  it  is 
necessary  to  determine  the  number  of  binary  sequences  Mn(s,r^,h) 
for  which  S  =  s,  R1  =  r^,  H  =  h.  This  number  is  easily  found 
to  be 


Mn(s,rl’h) 


* 


(2) 


3 


and  the  number  of  sequences  for  which  only  the  first  two  conditions 
are  satisfied  is 


M  (s  ,r.  ) 
n  1 


2 


l  M  (s.,r,  ,h) 
h  =  0  n 


(3) 


a  result  which  was  also  obtained  by  McQuistan  [  4 ]  for  the  degeneracy 
of  nearest  neighbor  pairs. 

If  the  chain  {X^}  is  of  2nd  order  then  [  2  ]  the  sufficient 

n 


statistics  are  S,  ,  H  together  with  ^2  =  E  *i^i  2’ 


i=3 


C  =  l  xixi_1xi_2»  U  =  X2  +  X  -i»  V  =  xiX2  +  xn_iX  •  Again 
i=3  1  -  1-  n~  n 


the  joint  distribution  of  these  statistics  requires  the  knowledge 
of  the  number  of  binary  sequences  M^( s  ,r^  ,r2  ,c  ,h  ,u ,  v)  for  which 
S  =  s,...,V  =  v.  This  was  found  [  l]  to  be 


Mn(s,rl’r2’C,h,U,v) 


/max(h,u) 


s-r,-ll 


\  r2-C 


max(h,u)/  \  v 


/  n-ls+r^+h-^ 


S-r^-r^+c-h-u+v 


/r^l 


/fe-r^-h\ 


irrc*vi 


(4) 


-1 

with  the  convention  [  1  =_1  and  where  (h,u,v)  takes  values 

-1 


in  {(0,0,0), (0,1,0), (0,2,0), (1,0,0), (2, 0,0),  (1,1,0)  ,  (1,1,1), 
(2, 1,1), (1,2,1), (2, 2, 2)}.  It  follows  that  the  number  of  binary 
sequences  for  which  only  the  first  four  conditions  are  fixed  is 
obtained  by  summing  over  (h,u,v).  We  have 
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M  (s,r  ,r-,c)  =  l  M  <s,r  ,r  ,c,h,u,v) 

n  1  2  (h,u,v)  n  1  2 

l  +  l 

(0,0,0), (0,1,0), (0,2,0)  (1,0,0), (2, 0,0), (1,1,0) 

+  1  +  M  (s,r  ,r  ,c,2,2,2) 

(1,1,1), (2, 1,1), (1,2,1)  n  1  2 


Upon  noting  that  =  n^,  c  =  n.^^ ,  r^-c  =  n^g^,  where 

n. .  ,  refers  to  the  frequency  of  ij,...,k  in  the  binary 

X  ^ 

sequence,  we  recognize  the  next  nearest  neighbor  degeneracy 
obtained  by  McQuistan  t  4  ] 
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Obviously  (5')  can  be  obtained  from  (5)  by  linearity.  Here,  and 

in  what  follows,  we  use  the  notation  M  (c,  )  to  denote 

n  1  k 

the  number  of  binary  sequences  for  which  c-,...,c^  are  fixed. 
It  is  readily  seen  that  (5)  reduces  to  (3)  by  summing  over  r^-c 

and  c . 


3.  HIGHER  ORDER  DEGENERACIES 

In  the  previous  section  we  illustrated  via  two  examples 
that  arrangement  degeneracies  may  be  viewed  as  special  cases  of 
counting  problems  associated  with  sufficient  statistics  in  Markov 
chains.  The  next  thing  which  comes  to  mind  is  the  question  of 
generalizations.  It  is  quite  clear  now  that  if  one  wants  to  look 
into  or  define  higher  degeneracies  one  should  examine  the  sufficient 
statistics  and  linear  functions  thereof  of  higher  order  two  state 
Markov  chains.  By  the  very  definition  of  sufficient  statistics, 
it  is  intuitively  clear  that  every  conceivable  arrangement 
degeneracy  can  be  obtained  by  summing  over  of  a  given  order. 

We  need  not  consider  all  the  sufficient  statistics  associated 
with  a  given  order  but  only  those  which  define  desired  neighbor 
types.  To  make  this  point  clear  we  shall  defer  the  general  counting 
problem  to  the  next  section  while  concentrating  here  on  straight¬ 
forward  generalizations  of  (3)  and  (5).  We  shall  first  illustrate 
this  claim  by  examining  sufficient  statistics  associated  with 
31  order  chains.  In  this  connection  the  notion  of  a  ’’unit"  is 


useful . 
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DEFINITION:  An  m'th  order  unit  is  a  binary  sequence  which  starts 

with  a  1,  ends  with,  m  separating  0’s,  (if  needed  to  separate 
it  from  other  units)  and  in  which  each  0-run,  if  not  an  end  run, 
consists  of  at  most  m-1  0's.  Thus  a  unit  is  a  block  made  of  0-runs 
and  I-runs  where  the  length  of  the  0-runs  is  restricted  while  the 
length  of  the  1-runs  is  unrestricted. 

DEFINITION :  A  free  0  is  a  0  which  does  not  belong  to  any  unit. 

Observe  that  an  end  unit  is  a  unit  which  does  not  need 
the  separating  0's  at  the  end  in  order  to  be  recognized.  The 
notion  of  a  unit  is  useful  as  it  determines  the  general  form  of 
a  binary  series  which  satisfies  our  predetermined  conditions. 

For  example,  consider  a  first  order  chain  and  suppose  it  is  desired 
to  count  the  number  of  sequences  for  which  S  and  are  fixed 

at  s  and  r^.  There  are  s-r^  first  order  units  which  we 

permute  with  the  free  0's  in 

ways.  Next  we  place  a  0  at  the  end  of  each  first  order  unit 
in  one  way.  This  determines  the  form  of  the  binary  series.  Finally 
distribute  the  s  1 ' s  in  the  s-r^  units  so  that  none  is  empty  in 

ways.  This  yields  (3).  (5)  can  be  obtained  in  the  same 

way  by  first  determining  the  positions  of  the  2nc*  order  units  and 
then  placing  the  1st  order  units  in  the  2nc^  order  units.  This  is 
followed  by  the  distribution  of  the  s  l's.  This  procedure  gives 
rise  to  an  immediate  extension  of  (3)  and  (5). 


e 
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Consider  the  3rd  order  analog  of  (3)  and  (5).  The  statistics 

n 

of  interest  are  S,  R,  ,  R_ ,  C,  together  with  K=  £  X.X.  ,X.  „X . 

-L  /  1  1  —  -L  1—z  1  — 

and  njooi’  t^e  ^recluency  °f  1001  in  the  sequence.  These  are 

sufficient  statistics  (not  all  of  them!)  associated  with  3rd  order 

chains.  We  shall  determine  M  ( s ,r^ ,r2 , c ,k >n]_QQ]_ ) • 

rd 

There  are  (s-r^)  -  (r  -c)  -  n^QQq  3  order  units, 

(s-r.)  -  (r2-c)  2nd  order  units  and  s-r1  1st  order  units. 

First  permute  the  3rd  order  units  with  the  free  0's  in 


n-3s+2r1+r2~c+3 


s-r1-r2+c-n1Q01 


ways.  Next  place  the  2n  order  units  in  the 


"PCI  •  • 

3  order  units  leaving  none  empty  m 


s-r, -rj+c-l' 


‘1001 


place  the  1st  order  units  in  the  2nd  order  units  in 


ways.  Then 


s-r^-1 


r2-c 


ways  and  place  the  separating  0's  in  the  respective  units  in  one 

way.  Put  one  1  in  each  1st  order  unit  in  one  way.  There  remain 

r  l's.  There  are  r^c  1st  order  units  with  two  or  more  l's. 

Sc  choose  r,-c  1st  order  units  from  s-r..  and  put  one  1  in 
± 


each  in 


rrc 


ways 


.  There  are  c-k  1S  order  units  with  3 


or  more  l's.  So  select  c-k  1  order  units  from  r^c  and  put 


1  in  each  in 


r!"C 


ways.  Finally  place  the  remaining  k  l's 


in  the  c-k  lSt  order  units  allowing  "empty"  units  in 


ways.  Whence 
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Mn(s’rl’r2’C’k’n1001) 


rc-ll 

k 


r1-c 


s-rn 


\  c-k  /  \r  -c /  \  r  -c 


/s-r^-1 


s-r^-^+c-l^  j  n-3s+2v^+r^-c+3 


1001 


S_rl"r2+C_nl001 I 


(6) 


and  by  summing  over 
Mn(s’rl’r2’C’k) 


n1001  We  immediately  obtain 


lc-l\ 

K-c\ 

hri] 

Hi-A 

1 

\ k  1 

c-kj 

\rrci 

K> 

O 

\ 

n-2  s+r^  +  2 
s-r^-r^+c 


This  last  expression  yields  (5)  upon  summation  over  k.  We  can 

rewrite  (6)  in  a  form  which  resembles  (5').  Let  fz3  denote 

the  number  of  free  0's  associated  with  a  third  order  degeneracy. 

lfd 

That  is,  the  number  of  0's  which  do  not  belong  to  any  3  order 
unit.  Then  fz3  =  (n-s)  -  3[(n-s)-n101-n1001-l]  -  2n1001  -  n10x> 
and  (6)  becomes 


Mn(s ’nll’nlll’nllll’ni01,nl001) 


|nnr1l 

nll-nlll  1 

1  s~nll 

\  nllll 

inlll"nlllli 

lnil“nlll< 

S”nll”1' 


n 


101 


s"nirnioi_1' 


n-, 


\  1001 


/f  z3+s-ni;L-n101-n1001. 


S-nll -n10 l~nl 001 


(  6  '  ) 


From  (3),  (51),  (6*)  we  see  that  a  pattern  of  arrangement 
degeneracies  begins  to  emerge  whereby  the  highest  order  units  are 
permuted  with  the  free  0's  then  lower  order  units  are  placed  in 
higher  order  units,  successively,  then  the  separating  0’s  are 
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placed  in  one  way,  this  followed  by  the  distributions  of  l's  so 
that  s  ,n^  , .  . .  , .  . .  jn^gg  ^  are  preserved-  In  order 

to  arrive  at  the  general  result  suggested  by  the  above  scheme  we 
shall  employ  the  notation  *(m)  to  mean  a  *-run  of  length  m. 
Then  we  readily  have 


M 

n 


(s  ,n 


11 


111 


,nl(m+l) ,n101 ,n1001 


*nl0 (m-1 )1 5 


I'iw-H 

'nl(m-l)“nl(m)l 

!  *  *  * 

i  nn’nm 

1  s-nn  \ 

i  nl  (m+1 )/ 

\nl(m)*nl(m+l )/ 

iniii"nini( 

Vii-nin  / 

/"'-"ir1! 

jS~nll~nl0l'^ 

js~nn"n101  n10  (m-2  )  l-1  ^ 

•  •  •  1 

\  nioi  i 

\  n1001 

\  n10(m-l)l  / 

fzm+S-nll-n101 - - 


10 (m-1 ) 1 


S_nll"n101_* ’ '"n10(m-l)l  I 


(7) 


where  fz  stands  for  the  number  of  free  0fs  and  is  given  by 
m 

fzm  =  (n_s)  -  ffl[!s-"11)-n101-n10()1  n10(m-l)l-1"1 


-  n10l~2n1001  (m-1 )ni0(m_i)i* 

Observe  that  the  number  of  m'th  order  units  is  equal  to 


(8) 


(s-nll)  "  n101  "  n1001  n10 (m-1 )1 

since  we  essentially  evaluate  £x^( 1-X^_^) • • • (1-X^_m) .  From  (7) 
we  obtain  (3),  (5'),  (6’)  as  special  cases. 
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4.  THE  GENERAL  ARRANGEMENT  DEGENERACY  OF  MARKOV  CHAINS  AND  ITS 
APPLICATION  TO  THE  DISTRIBUTION  OF  RARE  EVENTS. 

In  the  previous  section  we  dealt  with  one  way  of  extending 
the  next  nearest  neighbor  degeneracy.  However,  this  is  only  a 
special  case  of  the  general  arrangement  degeneracy  associated 
with  an  m'th  order  Markov  chain  where  the  problem  is  to  count  the 
number  of  binary  sequences  for  which  all  the  sufficient  statistics 
are  fixed.  This  is  a  challenging  problem  for  which  no  general 
solution  exists  as  far  as  the  present  author  knows.  The  main 
difficulty  is  the  fact  that  not  all  the  low  order  units  can 
simply  be  placed  in  higher  order  units  as  in  the  previous  section 
since  some  conditions  are  violated.  Cleariy,  if  such  a  result 
is  available,  numerous  arrangement  degeneracies  can  be  deduced  from 

it.  This  solution  will  not  be  attempted  here.  However,  we  shall 

r*  d  •  •  •  • 

give  the  solution  for  3  order  chains  and  this  will  give  us  a 

clue  as  to  the  general  behavior  of  rare  events  in  binary  Markov 

chains . 

i?d 

The  sufficient  statistics  associated  with  a  stationary  3 
order  Markov  chain,  apart  from  ends  statistics,  are  S,  R^ ,  R^, 

C,  K  as  above  together  with 


1=4  XiXi-3’  R12 


n 


i  =  4 


x.x.  ,x. 

i  i-l  i-3’ 


21 


i  =  4 


X.X.  -X. 
l  1-2  1-3 


11 


R3  "  R12  R21  +  K  "  n!001‘ 


Also  define  the  statistics 


^  {  of  2  order  units  which  start  and  end  with  11 

A.  ,  =  #  of  2nc^  order  units  which  start  and  end  with  1. 


11-1 


=  I  of  2n  order  units  which  start  with  11  and  end  with  1. 


1-11 


=  #  of  2  order  units  which  start  with  1  and  end  with  11. 


s  #  of  2n  order  units  which  contain  exactly  two  consecutive  l's. 


s  #  of  2n  order  units  which  contain  exactly  one  1. 


Clearly 


All-ll  +  All-1  +  Al-ll  +  Al-1  +  A11  +  A1 


(S-R  )  -  (R  -C) 


A11 + 

All-ll 

*  An-i  = 

(R1-C) 

-  <R12-K)  (11) 

A11 + 

All-ll 

*  Ai-n  • 

(R1-C) 

-  (R21-K), 

which 

means 

that  if  A-^, 

All» 

and  A^  ^  are  known  then  so 

are 

Al-ll* 

All-1  and 

Al-1* 

As  it  turns  out  our  problem  is 

simplified  greatly  if  A][,  A11,  An_n  are  added  to  the  other 
eight  conditions.  The  reason  for  introducing  the  A's  is  that  it 
is  difficult  to  keep  track  of  R^2  and  ^21*  w*ia^-e  from 
it  is  seen  that  when  the  A's  are  fixed  in  addition  to  S, 

R2,  C,  K,  so  are  R12,  R21*  When  the  arrangement  degeneracy 

is  obtained  the  A's  can  be  removed  by  summation. 
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Recall  that  there  are 

(s-r1)  -  (r2~c)  -  (r3-r] 2_r21+k^ 

1?\J 

3  order  units  and 

(n-s)  -  3[(s-r1)-(r2-c)-(r3-r12-r21+k)-l]  -  2  (r3-r12_r2]  +k)  -  (r2~c) 


ways.  Place  the  separating  0's  in  one  way.  Now,  we  cannot 
place  the  1st  order  units  in  the  2nd  order  units  as  we  did  in  the 
previous  section  as  R^2,  ^21  c^an6e'  This  is  precisely  why  we 

need  the  A's.  So  according  to  A.^,  ,  App_i>  ^1-11’  All-ll’ 

A^  assign  "types”  to  the  2nd  order  units.  That  is  let 
2nd  order  units  start  and  end  with  11,  2  order  units  start 

with  11  and  end  with  1,  etc.  This  can  be  done  in 
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ways.  There  are  now  r2~c  0's  left  which  we  distribute  in 
+  A^^  +  Ai  if  +  A  i  2n<^  order  units  "nonempty"  in 


r2-c-l 


All-ll+All-l+Al-ll+Al-l~1 


r2-c-l 


(s-r])-(r2-c)-(A11+A1  )-lj 


ways.  This  takes  care  of  the  0's  and  the  1  order  units  as  well! 

It  remains  now  to  distribute  the  l's.  According  to  the  type 

assignment  place  11  and  1  in  the  2n<^  order  units  as  needed  in  one 

•  s  ii 

way,  and  then  put  1  in  every  empty  1  order  unit  (1-cell  or 

•  st 

1-run)  m  one  way.  There  are  r^-c  1  order  units  with  two 

or  more  l's.  But  we  have  already  A^  1  +  A1_11  +  2A11  ^  +  A^ 

S  "t 

1'  order  units  with  two  l’s.  So  select  (r^c)  -  ( A11_1+A1_ ^ ^  + 

S  "t 

2A11_11+A^1)  1  order  units  from  (s-r^  -  (A11_1+A1_11+2A11_11+ 

AH>  -  (A^_^+A^  ll  +  ^Ai  l+Al^  ^St  order  units  in 


(s_rl)  "  (A11-1+A1-11+2A11-11+A11)  "  (A11-1_A1-11+2A1-1+A1) 


(rl_c)  -  (A11_1+A1_11+2A11_11+A11) 


S  L  • 

ways  and  put  one  1  in  each.  We  now  have  r^-c  1  order  units 

s  t  • 

with  exactly  two  l's,  since  previously  no  1  order  unit  was 

g+-  t  ^ 

empty.  There  are  c-k  1  order  units  with  3  or  more  l's.  From 

c  + 

the  r^-c  1  order  units  which  contain  exactly  2  l's  select 
c-k  units  and  put  one  1  in  each  in 


rrc 


c-k 
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S  "t  • 

ways.  We  now  have  c-k  1  ordv,r  units  with  exactly  3  l's. 
Finally,  there  remain  k  l's  which  we  place  in  these  c-k  1 
order  units  allowing  "empty"  units  in 


st 


c-k+k-1  \ 

.  f°i 

\c-k-l  j 

\  k  J 

ways.  It  follows  that 

Mn(s,rl,r2’C,r3’r12,r21’k,All,Al’All-ll) 


/  c-l\ 

(rrc'\ 

\  k  / 

o 

i 

/( S_rl )  2(A11-11+A11-1+A1-11+A1-1)“(A11+A1)^ 

•  (  "2  C  l  ] 

\  (rl“c)~(All-l+Al-ll+2All-ll+All  1 

\(s-ri)-(r2-c)-(A11+A1)-l / 

(s-r-L)-(r2~c) 


li’Aii’Aii-ii>I'i-0'ri2*k-An-Aii-ii>ri‘c'r2i,k-Aii-An-ii’Ai-il 


l(s-r1)-(r2-c)-l\ 

[  (n-s)-2(s-r1)+(r2~c)+3  l 

,  r3_r12'r21+k  i 

^s-r1)-(r2-c)-<r3-r12-r21+k)/ 

(12) 


where  A^^  is  obtained  from  (11)  in  terms  of  A^^i*  A^, 

A  ,  s,  r1}  r?,  c,  r12,  r21,  k,  and  is  too  long  to  write. 
Thus  the  desired  arrangement  degeneracy  is  obtained  by  summing 
over  Au>  A11_11,  \ ,  adhering  to  the  convention  [  \  =  1. 


We  have 
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“here  P*.X.  ,x.  ,  =  ^(Xi=*iK-l=Xi-l’Xi-2=’‘i-2’Xi-3  =  *i-3>’ 

1  l.rl  1  *"vZ  l.T.v 

from  which  it  follows  that  the  joint  distribution  of  S,  R^ ,  R?, 

C  j  >  R  ^  2 ,  ^2i »  K ,  X^  ,  ^2  *  ^3 »  X^  2 *  X^  ^ ^  is  the 

product  ot  (15)  and  the  arrangement  degeneracy  of  these  statistics. 
But  when  X?  ,  X3>  xn_2’  *n-l’  Xn  are  eclual  to  0  this 

arrangement  degeneracy  is  asymptotically v  as  n— ►  °°,  the  same  as 
(13).  Therefore  from  (13),  (14)  and  (15) 

Pr(S=s,R1=0,R2=0,- • • ,K=0,X1=X2=X3=Xn_2=Xn_1=Xn=0) 


n-3s+3 


s  s  s  s  n_4S- 3 

P1000  P0001  P0100  P0010  poooo 


Assume  that  as  n  — *■  00 

(i)  The  l's  become  rare  separated  by  long  0-runs  so  that  the 
event  {R1  =  0  ,R2  =  0 ,  •  •  •  ,K=0  ,X1=X2=X3  =  Xri_2  =  Xn_1=Xn=0}  becomes 


a  sure  event. 


p1000 


such  that  nP^o0O  =  a’  fixed 


(a.11)  P000i  ~  p0100  ~  P0010  ~  p0000  "  1"P1000* 


Then  from  (16),  as  n  becomes  large. 


Pr(S=s)  ~ 


(n-3s+3)I  [a\s  I  a\ 
(n-4s+3)!s!  \n|  \  n| 


n-s-3 


a  res.ult  which  ia  well  expected  [2  3, 

This  procedure  for  finding  the  asymptotic  distribution  of  S 
as  n  -*•  00  can  be  eaaily  extended  to  the  m’th  order  case.  Arguing 
as  above  we  have  from  (7) (as  the  last  binomial  coeeficient  in  the 
most  general  case  is  given  by  the  last  coeeficient  in  (7)  which 
stands  for  the  number  of  permutations  of  the  m'th  order  units  with 
the  free  0's) 


where  n  -  “  and  P100...0  0  such  that  np1Q0>..0  =  a. 

‘\ 

More  results  of  this  nature  can  be  obtained  once  the  arrange- 
ment  degeneracy  of  a  specific  order  is  known.  For  example,  from 
(12)  it  should  not  be  too  difficult  to  show  that  under  some  conditions 
similar  to  (i)-(iii>  S-I^  is  also  asymptotically  poisson.  In 
fact  poisson  with  parameter  a(l-X)  where  A  €(0,1)  is  a  measure 
of  the  density',  or  clutering  tendency,  of  rare  events.  For  an 
interpretation  of  this  fact  see  [2  ]. 
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